Improving Human Pose Recognition Accuracy using CRF modeling
نویسندگان
چکیده
Interest in robotics in the domain of manufacturing industry has shown an outstanding growth recently in scenarios where human beings and robots are present simultaneously. Humans and robots often share the same workspace and this poses a lot of threats to the human safety issues [1] e.g. in manufacturing industry, in automobile industry where automobile components are integrated, in medical industry where minimally-invasive-surgery is facilitated and so on. In the proposed approach, segmentation is defined as a classification task and is used for pixelwise object class labeling of human body-parts. Depth measurements from a KINECT RGB-D ceiling sensor are obtained in order to do the pixelwise object class labeling. The ultimate intended use is in the safe human-robot collaboration (SHRC) and interaction (SHRI) domains for challenging domestic and industrial environments. Within this scope, a pairwise conditional random field (CRF) approach is used for labeling. CRF is formulated in terms of an energy minimization (EM) problem while an efficient random decision forest (RDF) is used for classification. In [4], we show how an RDF classifier is used for pixelwise classification of human body-parts using deph data. We found that there exists misclassification of labels assigned to each pixel and that should be minimized for feasible and practical human-robot cooperation. This work builds on top of our previous work Dittrich et al. [4] in order to improve recognizing human body-parts.
منابع مشابه
Latent Pose Estimator for Continuous Action Recognition
Recently, models based on conditional random fields (CRF) have produced promising results on labeling sequential data in several scientific fields. However, in the vision task of continuous action recognition, the observations of visual features have dimensions as high as hundreds or even thousands. This might pose severe difficulties on parameter estimation and even degrade the performance. To...
متن کاملA Practical Activity Recognition Approach Based on the Generic Activity Framework
In spite of the obvious importance of activity recognition technology for human centric applications, stateof-the-art activity recognition technology is not practical enough for real world deployments because of the insufficient accuracy and lack of support for programmability. The authors introduce a generic activity framework to address these issues. The generic activity framework is a refine...
متن کاملA Probabilistic Approach to Persian Ezafe Recognition
In this paper, we investigate the problem of Ezafe recognition in Persian language. Ezafe is an unstressed vowel that is usually not written, but is intelligently recognized and pronounced by human. Ezafe marker can be placed into noun phrases, adjective phrases and some prepositional phrases linking the head and modifiers. Ezafe recognition in Persian is indeed a homograph disambiguation probl...
متن کاملActivity Recognition Using Biomechanical Model Based Pose Estimation
In this paper, a novel activity recognition method based on signal-oriented and model-based features is presented. The model-based features are calculated from shoulder and elbow joint angles and torso orientation, provided by upper-body pose estimation based on a biomechanical body model. The recognition performance of signal-oriented and model-based features is compared within this paper, and...
متن کاملAutomatic Modeling and Localization for Object Recognition
Being able to accurately estimate an object’s pose (location) in an image is important for practical implementations and applications of object recognition. Recognition algorithms often trade off accuracy of the pose estimate for efficiency—usually resulting in brittle and inaccurate recognition. One solution is object localization—a local search for the object’s true pose given a rough initial...
متن کامل